NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can’t Answer?

https://doi.org/10.18653/v1/2025.naacl-short.5

Balepur, Nishant; Gu, Feng; Ravichander, Abhilasha; Feng, Shi; Boyd-Graber, Jordan; Rudinger, Rachel (January 2025, emae)

Language models like ChatGPT are pretty good at answering questions (e.g. "What is 12 * 12?"), but we show they can surprisingly struggle when asked to do the reverse task: generating questions for answers (e.g. "Give me a question with the answer 144"). We study when these errors happen, what might be causing them, and how they can be addressed.
more » « less
Full Text Available
Incorporating Taxonomic Reasoning and Regulatory Knowledge into Automated Privacy Question Answering

Ravichander, Abhilasha; Yang, Ian; Chen, Rex; Wilson, Shomir; Norton, Thomas; Sadeh, Norman (November 2024, International Web Information Systems Engineering)

Full Text Available
Incorporating Taxonomic Reasoning and Regulatory Knowledge into Automated Privacy Question Answering

https://doi.org/10.1007/978-981-96-0579-8_31

Ravichander, Abhilasha; Yang, Ian; Chen, Rex; Wilson, Shomir; Norton, Thomas; Sadeh, Norman (November 2024, Springer Nature Singapore)

Privacy policies are often lengthy and complex legal documents, and are difficult for many people to read and comprehend. Recent research efforts have explored automated assistants that process the language in policies and answer people’s privacy questions. This study documents the importance of two different types of reasoning necessary to generate accurate answers to people’s privacy questions. The first is the need to support taxonomic reasoning about related terms commonly found in privacy policies. The second is the need to reason about regulatory disclosure requirements, given the prevalence of silence in privacy policy texts. Specifically, we report on a study involving the collection of 749 sets of expert annotations to answer privacy questions in the context of 210 different policy/question pairs. The study highlights the importance of taxonomic reasoning and of reasoning about regulatory disclosure requirements when it comes to accurately answering everyday privacy questions. Next we explore to what extent current generative AI tools are able to reliably handle this type of reasoning. Our results suggest that in their current form and in the absence of additional help, current models cannot reliably support the type of reasoning about regulatory disclosure requirements necessary to accurately answer privacy questions. We proceed to introduce and evaluate different approaches to improving their performance. Through this work, we aim to provide a richer understanding of the capabilities automated systems need to have to provide accurate answers to everyday privacy questions and, in the process, outline paths for adapting AI models for this purpose.
more » « less
Full Text Available
Understanding How to Inform Blind and Low-Vision Users about Data Privacy through Privacy Question Answering Assistants

Feng, Yuanyuan; Ravichander, Abhilasha; Yao, Yashing; Zhang, Shikun; Chen, Rex; Wilson, Shomir; Sadeh, Norman (August 2024, 33rd USENIX Security Symposium (USENIX Security 24) - USENIX Association)

Understanding and managing data privacy in the digital world can be challenging for sighted users, let alone blind and lowvision (BLV) users. There is limited research on how BLV users, who have special accessibility needs, navigate data privacy, and how potential privacy tools could assist them. We conducted an in-depth qualitative study with 21 US BLV participants to understand their data privacy risk perception and mitigation, as well as their information behaviors related to data privacy. We also explored BLV users’ attitudes towards potential privacy question answering (Q&A) assistants that enable them to better navigate data privacy information. We found that BLV users face heightened security and privacy risks, but their risk mitigation is often insufficient. They do not necessarily seek data privacy information but clearly recognize the benefits of a potential privacy Q&A assistant. They also expect privacy Q&A assistants to possess cross-platform compatibility, support multi-modality, and demonstrate robust functionality. Our study sheds light on BLV users’ expectations when it comes to usability, accessibility, trust and equity issues regarding digital data privacy.
more » « less
Full Text Available
What is in my big data?

Elazar, Yanai; Bhagia, Akshita; Magnusson, Ian; Ravichander, Abhilasha; Schwenk, Dustin; Suhr, Alane; Walsh, Pete; Groeneveld, Dirk; Soldaini, Luca; Singh, Sameer; et al (May 2024, ICLR)

Full Text Available
Understanding How to Inform Blind and Low-Vision Users about Data Privacy through Privacy Question Answering Assistants

https://doi.org/10.17605/OSF.IO/K9FV6

Feng, Yuanyuan; Ravichander, Abhilasha; Yao, Yaxing; Zhang, Shikun; Chen, Rex; Wilson, Shomir; Sadeh, Norman (January 2023, OSF)

This repository archives the supplemental materials for the USENIX Security '24 paper of the same title.
more » « less
Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?

https://doi.org/10.18653/v1/2021.acl-long.319

Ravichander, Abhilasha; Black, Alan; Norton, Tom; Wilson, Shomir; Sadeh, Norman (August 2021, Computational linguistics)

Decomposable tasks are complex and comprise of a hierarchy of sub-tasks. Spoken intent prediction, for example, combines automatic speech recognition and natural language understanding. Existing benchmarks, however, typically hold out examples for only the surface-level sub-task. As a result, models with similar performance on these benchmarks may have unobserved performance differences on the other sub-tasks. To allow insightful comparisons between competitive end-to-end architectures, we propose a framework to construct robust test sets using coordinate ascent over sub-task specific utility functions. Given a dataset for a decomposable task, our method optimally creates a test set for each sub-task to individually assess sub-components of the end-to-end model. Using spoken language understanding as a case study, we generate new splits for the Fluent Speech Commands and Snips SmartLights datasets. Each split has two test sets: one with held-out utterances assessing natural language understanding abilities, and one with heldout speakers to test speech processing skills. Our splits identify performance gaps up to 10% between end-to-end systems that were within 1% of each other on the original test sets. These performance gaps allow more realistic and actionable comparisons between different architectures, driving future model development. We release our splits and tools for the community
more » « less
Full Text Available
A Tale of Two Regulatory Regimes: Creation and Analysis of a Bilingual Privacy Policy Corpus

Arora, Siddhant; Hosseini, Henry; Utz, Christine; Bannihatti Kumar, Vinayshekhar; Dhellemmes, Tristan; Ravichander, Abhilasha; Story, Peter; Mangat, Jasmine; Chen, Rex; Degeling, Martin; et al (May 2022, LREC proceedings)

Over the past decade, researchers have started to explore the use of NLP to develop tools aimed at helping the public, vendors, and regulators analyze disclosures made in privacy policies. With the introduction of new privacy regulations, the language of privacy policies is also evolving, and disclosures made by the same organization are not always the same in different languages, especially when used to communicate with users who fall under different jurisdictions. This work explores the use of language technologies to capture and analyze these differences at scale. We introduce an annotation scheme designed to capture the nuances of two new landmark privacy regulations, namely the EU’s GDPR and California’s CCPA/CPRA. We then introduce the first bilingual corpus of mobile app privacy policies consisting of 64 privacy policies in English (292K words) and 91 privacy policies in German (478K words), respectively with manual annotations for 8K and 19K fine-grained data practices. The annotations are used to develop computational methods that can automatically extract “disclosures” from privacy policies. Analysis of a subset of 59 “semi-parallel” policies reveals differences that can be attributed to different regulatory regimes, suggesting that systematic analysis of policies using automated language technologies is indeed a worthwhile endeavor.
more » « less
Full Text Available
A Tale of Two Regulatory Regimes: Creation and Analysis of a Bilingual Privacy Policy Corpus

Arora, Siddhant; Hosseini, Henry; Utz, Christine; Bannihatti, Vinayshekhar K.; Dhellemmes, Tristan; Ravichander, Abhilasha; Story, Peter; Mangat, Jasmine; Chen, Rex; Degeling, Martin; et al (January 2022, LREC proceedings)

Full Text Available
Question Answering for Privacy Policies: Combining Computational and Legal Perspectives

https://doi.org/10.18653/v1/D19-1500

Ravichander, Abhilasha; Black, Alan; Wilson, Shomir; Norton, Thomas; Sadeh, Norman (November 2019, 2019 Conference on Empirical Methods in Natural Language Processing)

Privacy policies are long and complex documents that are difficult for users to read and understand, and yet, they have legal effects on how user data is collected, managed and used. Ideally, we would like to empower users to inform themselves about issues that matter to them, and enable them to selective explore those issues. We present PRIVACYQA, a corpus consisting of 1750 questions about the privacy policies of mobile applications, and over 3500 expert annotations of relevant answers. We observe that a strong neural baseline underperforms human performance by almost 0.3 F1 on PRIVACYQA, suggesting considerable room for improvement for future systems. Further, we use this dataset to shed light on challenges to question answerability, with domain-general implications for any question answering system. The PRIVACYQA corpus offers a challenging corpus for question answering, with genuine real-world utility.
more » « less
Full Text Available

Search for: All records